CDS
Accession Number | TCMCG065C17175 |
gbkey | CDS |
Protein Id | XP_012701311.1 |
Location | complement(join(10929028..10929525,10929993..10930120,10930392..10930488,10930574..10930960,10931099..10931284,10937393..10937670,10937814..10937940,10938129..10938325,10938500..10938547,10938614..10938647,10938720..10938848,10939482..10939577,10940024..10940090,10940476..10940531,10941359..10941454,10941910..10942960,10943708..10943892)) |
Gene | LOC101762866 |
GeneID | 101762866 |
Organism | Setaria italica |
Protein
Length | 1219aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA207554 |
db_source | XM_012845857.2 |
Definition | DNA mismatch repair protein MSH7 isoform X1 [Setaria italica] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCAGCCGCGGCGGCAGCAGCAGCAGTCCATCCTATCGTTCCTCCAGAAGCCGCCACGGGATCCCGCCGGCGCCGGGGAGGGCACGCCCCCTGAGAAGCCCCCGCGCCCGCCCGCGGGGTCGGTCGCGGGCATCATGGAGAGGCTCGTGCGCCCGCCACCGCGGCCGCAGCCGCCGCAGGGGAGCAGAAATCAAGATGCTTCCCAGGCTGGACATTTCAGTGGGAAAACCTTGCCTGGAAGAATTCGTGTTCCCTCAGATGGGCACTCAAGCGCATTGTCCTCAGGTTCCTGGAACGGTGAATACGGCAGAGCAACCATGTTTCCAAAACAAGGTTCGGGTATCATTCCCTCACAGGAGCCCCAGAAGTACCCATTGAGGTCCTCCACTGATGAATTTGTTCAAGCAAGCTCACTGGTCCCAGAATTTGGGCCAAATCAGACTCCTCTTCAGGCTAGATCCTTATTTGAAGACTTTGATGTACAAACACCTTCACAGGTTTCCTCAAAGAAAGTCTTCCTGGGGCCTGCTCATGGAGCTGATACACCTCTAACAGAATCCGGTTCAGATCGAACTCATTTACAGCATTCAGCAAAGAAGTTCTCATTGGTTTCTGCTAATGATGAATATACTAGAGCAGCGACAACCTTTGTGCTGAATTCAAATGATACTCGTACAGAGGAACATTTAAATAAGCTATGCCCAGGGTCTTCAGATCCTTTGTATATTAAAGCAACAAATTTGTTTGCAGAATTCGAGGCAAATGCAACTCCATTGAAGAATCACTCGAAAAATTCCTCCCTTCTTATGAATGATAAACATATTGGAGCAGCTGCTACTATATTTCCAGAACTTGATTCTAGTCCTCTGAAACCAGAAACGCCAGCAATGCGAGCAGTCATTCCTCGCCTGAAGCGAGTTCAAGAAGAACAAGGTGTGGCTGCCAACAAACCATGCTCTCCTTTGTGGGTCTCGAACAAGAAGATGAAATCAGCTAATTGTTCTCCTATTGAGAAGAAGGATCGTGATGAAATGGCTGATAGTGCGCGTAGGAAGTTTGAGTGGCTGAATCCATCTACCATCAGGGATGCAAATAGAAGGCGTCCAGATGATCCACTTTATGACAAGAGTACACTTTTTATTCCACCTGATGCATTGAGAAAGATGTCAACATCTCAAAAGCAATACTGGAATATTAAGTGTAAATATATGGACGTTGTCCTCTTTTTCAAAGTGGGAAAATTTTATGAGCTCTATGAGCTAGATGCTGAGATTGGCCAGAAAGAACTTGACTGGAAAATGACTGTTAGTGGGGTGGGCAAGTGCCGACAGGTTGGCATTTCAGAAAGTGGGATAGATGCTGCTGCTGATAAGCTTGTAGCTCGGGGGTATAAAGTTGGAAGAATAGAGCAAATGGAATCTGCAAACCAGGCCAAAGCTAGAGGATCAAATGCAGTTATTGAAAGAAAGCTGCTTAATGTGTCCACACCGTCGACTGCAGTTGATAGCAACATTGGTACGGATGCTGTTCACCTTCTTGCACTGAAAGAGGTTACCCTATCTTCTAGTAGTTCTCGGGTCTATGGATTTGCTTTCCTAGACTATGCTGCTCTTAAAATTTGGGTTGGATCACTCCATGATGATGATTCGTCTGCAGCTTTGGGGGCTTTGTTGGTGCAGGTTTCTCCAAGAGAAATAATCTATGAAACCTCAGGCCTCTCAAAAGAAACTCATAAAGCGATCAGAAAATATGCCTCAGCAGGATCTGTGAAGATGCAGCTGACCCCCCTACCTGGGATAGATTTCTCTGATGTTTCACAAATTCGAATGTTAATACATTCGAAAGAGTACTTTACAGCATCAGCAGAGTCGTGGTTATCTGCTTTGGATTGTGCATTGAATCGAGATGCAATTATTTGTGCACTTGGTGGACTTATTGGTCATTTGACTAGGCTCATGTTACATGATGCCCTGAAAAATGGGGAAGTCTTATCATACCACGTGTACAAAACCTGTCTAAGGATGGATGGTCAAACTCTTGTGAACCTTGAGATTTTCAGCAACAATTTTGATGGCGGTTCATCAGGTACTTTATATAAGCACCTTAATCAATGTGTCACAGCATCTGGTAAGAGGCTGCTAAGAAGGTGGATTTGTCATCCGCTTAAGGACATTGATGCTATCAATAAAAGGCTCGATGTTGTTGAGGCCTTCATCCAAAACTGTGGACTGGGCCCTACAACACTTGGATATCTCCGCAAAATTCCTGATCTTGAGAGATTGTTAGGACAAGTTAAATCTACTGTTGGATTATCATCTTCAATTCAATTGCCGTTTGTTGGAGAAAGGATATTAAAGAAACGGATCAGAACATTTATAATGCTTATCAACGGCCTCCGGAATGGACTTGATTTACTAAATGACTTACAAAGAGCTGATCATGGTGTATCAGCACTTTATAAGGTTGTAGAGATTCCAACATTGAGCTCCCTTCATGAATTGATCCATCAATTTGAGAAGAGAGTACAAGAGGAATTTCCATGTTACCAGGATCTTGGTGTCGAAGATAGTGATGGCGACACTTTGGCTCTTCTAGTGGGACTTTTTGTTAGAAAGGCTTCTGAATGGTCTTTAGTGATCAATGCTGTGAGCACTATTGATGTGCTTAGGTCCTTTGCTGCAATGACATTGTCATCATTCGGCACCATGTGCAAACCACACATTCTACTGAAAGATGATGTGCCTATACTTCGGATGAAGGGTCTATGGCATCCCTATGCTTTTGCAGAAAGTGCAAACGGGTTGGTACCAAATGATTTAACCCTTGGTCAGGATTTATCTGGCTTCAATCGTTTTGCGTTGTTGTTGACTGGTCCAAATATGGGTGGGAAATCTACAATGATGCGTGCCACCTGCCTGACTATTGTGCTTGCCCAGCTTGGCTGTTATGTCCCCTGCACATCCTGTGAATTGACCCTTGCAGATTCCATCTTTACACGTCTTGGTGCAACAGATCGGATTATGTCTGGAGAGAGCACATTTCTTGTTGAATGTACAGAGACTGCTTCTGTTCTTCAGAATGCAACTGAGGATTCTCTTGTATTGCTTGATGAGCTTGGCAGAGGAACTAGCACATTTGATGGATATGCAATTGCGTATGCTGTGTTCCGGCACCTCGTGGAGCAGGTGCGATGCCGTCTGCTCTTCGCCACTCACTACCACTCTCTGACAAAGGAGTTCGCCTCCCACCCTCACGTGAGCCTCCAGCACATGGCCTGCATGTTCAGAGCAAGGAGCGGCGCCCATGATGTCAATGGCGAGAAGGAGCTCACCTTCCTCTACCGTCTTGCCTCAGGGGCCTGTCCAGAGAGCTACGGCCTACAGGTCGCCACAATGGCAGGGATTCCGAAGTCGATAGTGGACAAGGCATCCGTCGCAGGCCAGGCGATGAGGTTGAAGATTGCCGGTAACTTCAAGTCCAGCGAAGAGCGGGCCGCGTTCTCAACCCAACATGAAGAGTGGCTGAGGACGGCCATGTCGGTCATCGTGAAGGACGGGCACCTAGATGAGGACATCATGGACACGTTGTTCTGCGTCTGCCAAGAGCTGAAGTTTCACTTCAGGAAAGCGAGACGAGCGTCCACCGCCACTGACCACTAA |
Protein: MQPRRQQQQSILSFLQKPPRDPAGAGEGTPPEKPPRPPAGSVAGIMERLVRPPPRPQPPQGSRNQDASQAGHFSGKTLPGRIRVPSDGHSSALSSGSWNGEYGRATMFPKQGSGIIPSQEPQKYPLRSSTDEFVQASSLVPEFGPNQTPLQARSLFEDFDVQTPSQVSSKKVFLGPAHGADTPLTESGSDRTHLQHSAKKFSLVSANDEYTRAATTFVLNSNDTRTEEHLNKLCPGSSDPLYIKATNLFAEFEANATPLKNHSKNSSLLMNDKHIGAAATIFPELDSSPLKPETPAMRAVIPRLKRVQEEQGVAANKPCSPLWVSNKKMKSANCSPIEKKDRDEMADSARRKFEWLNPSTIRDANRRRPDDPLYDKSTLFIPPDALRKMSTSQKQYWNIKCKYMDVVLFFKVGKFYELYELDAEIGQKELDWKMTVSGVGKCRQVGISESGIDAAADKLVARGYKVGRIEQMESANQAKARGSNAVIERKLLNVSTPSTAVDSNIGTDAVHLLALKEVTLSSSSSRVYGFAFLDYAALKIWVGSLHDDDSSAALGALLVQVSPREIIYETSGLSKETHKAIRKYASAGSVKMQLTPLPGIDFSDVSQIRMLIHSKEYFTASAESWLSALDCALNRDAIICALGGLIGHLTRLMLHDALKNGEVLSYHVYKTCLRMDGQTLVNLEIFSNNFDGGSSGTLYKHLNQCVTASGKRLLRRWICHPLKDIDAINKRLDVVEAFIQNCGLGPTTLGYLRKIPDLERLLGQVKSTVGLSSSIQLPFVGERILKKRIRTFIMLINGLRNGLDLLNDLQRADHGVSALYKVVEIPTLSSLHELIHQFEKRVQEEFPCYQDLGVEDSDGDTLALLVGLFVRKASEWSLVINAVSTIDVLRSFAAMTLSSFGTMCKPHILLKDDVPILRMKGLWHPYAFAESANGLVPNDLTLGQDLSGFNRFALLLTGPNMGGKSTMMRATCLTIVLAQLGCYVPCTSCELTLADSIFTRLGATDRIMSGESTFLVECTETASVLQNATEDSLVLLDELGRGTSTFDGYAIAYAVFRHLVEQVRCRLLFATHYHSLTKEFASHPHVSLQHMACMFRARSGAHDVNGEKELTFLYRLASGACPESYGLQVATMAGIPKSIVDKASVAGQAMRLKIAGNFKSSEERAAFSTQHEEWLRTAMSVIVKDGHLDEDIMDTLFCVCQELKFHFRKARRASTATDH |